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ABSTRACT 

Osteoarthritis (OA), the most common form of arthritis, 
is a highly debilitating disease of the joints and can lead 
to severe pain and disability. There is no cure for OA. 
Current treatments often fail to alleviate its symptoms 
leading to an increased demand for joint replacement 
surgery. Previous epidemiological and genetic research 
has established that OA is a multifactorial disease with 
both environmental and genetic components. Over the 
past 6 years, a candidate gene study and several 
genome-wide association scans (GWAS) in populations 
of Asian and European descent have collectively 
established 15 loci associated with knee or hip OA that 
have been replicated with genome-wide significance, 
shedding some light on the aetiogenesis of the disease. 
All OA associated variants to date are common in 
frequency and appear to confer moderate to small effect 
sizes. Some of the associated variants are found within 
or near genes with clear roles in OA pathogenesis, 
whereas others point to unsuspected, less characterised 
pathways. These studies have also provided further 
evidence in support of the existence of ethnic, sex, and 
joint specific effects in OA and have highlighted the 
importance of expanded and more homogeneous 
phenotype definitions in genetic studies of OA. 
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INTRODUCTION 

Osteoarthritis (OA) is a set of disorders of the mus- 
culoskeletal system characterised by degradation 
and loss of articular cartilage in synovial joints 
most commonly of the knee, hip, hand, foot, and 
spine. OA development appears to be a result of a 
complex set of interactions between mechanical, 
biological, biochemical, and molecular factors that 
destabilise the normal coupling of degradation and 
synthesis of articular cartilage chondrocytes and 
extracellular matrix, and subchondral bone. 
Although OA invariably involves articular cartilage, 
it affects all tissues of the joint; loss of articular car- 
tilage is accompanied by subchondral bone remod- 
elling with sclerosis and in many instances cysts, 
osteophyte formation at joint margins, ligamentous 
contractures and relaxation, muscle atrophy and 
spasms, and at clinical stages of the disease inflam- 
mation of the synovial membrane. 1 2 

The health and socioeconomic burden posed by 
OA is substantial. The main symptom of OA is pain 
and loss of physical function leading to impaired 
mobility and impaired quality of life. 3 Current regi- 
mens for OA management are multimodal in 
nature — that is, a combination of pharmacologic 
and non-pharmacologic treatments. 2 However, 
these are often ineffective in targeting the main 
disease symptom leading to an increased demand 
for total joint replacement (TJR). 4 OA is the most 
prevalent form of arthritis affecting over 40% of 
people over the age of 70, 5 and its incidence is on 



the rise. In the USA alone 27 million adults had 
clinical evidence of OA in 2005, a rise of nearly 
30% from the estimate of 21 million in 1995. 6 7 
With longer life expectancies and the obesity pan- 
demic — with age and obesity/overweight being well 
established risk factors for disease development and 
progression — the prevalence of OA is expected to 
increase continuously and sharply. 

Although the aetiology of OA is not fully under- 
stood it has been well established that the disease is 
caused by complex interplay between environmen- 
tal and genetic factors. Age is the strongest risk 
factor for all types of OA whereas obesity appears 
to confer the greatest risk in knee OA, particularly 
among women. Epidemiological research also sug- 
gests that occupational physical workload, high 
sporting activity, joint injuries and being female 
may increase the risk of developing OA at particu- 
lar joints (reviewed in Altman, 2 and Bierma- 
Zeinstra and Koes 8 ). 

GENETIC STUDIES IN OA 

The pre-genome-wide association scans era 

Twin pair, sibling risk and segregation studies con- 
ducted in Europe and the USA have demonstrated 
a substantial genetic component for OA that is 
transmitted in a non-Mendelian manner, which is 
typical of multifactorial diseases. Heritability esti- 
mates range between 40-65%, with precise esti- 
mates varying depending on gender, affected joint, 
and severity of the disease, but overall appear 
stronger for hand and hip OA than for knee 
OA. 9 10 Familial aggregation studies in the UK have 
estimated that the sibling recurrence risk (k s ) — 
which indicates the disease risk of a sibling to an 
individual with OA compared to the disease preva- 
lence in the general population — is ~5. 10 The 
notion that OA is simply a wear-and-tear disease of 
old age was largely superseded and these epidemio- 
logical studies provided a firm foundation for con- 
siderable genetic research aimed at identifying 
genetic loci responsible for OA susceptibility. 

To date five genome-wide linkage scans per- 
formed on individuals collected in the UK, Finland, 
Iceland, and the USA have been published for OA 
but had limited success. 10 Gene centric association 
studies, commonly known as candidate gene 
studies, have been extensively applied in popula- 
tions of European and Asian ancestry to survey var- 
iants across genes believed to be implicated in OA 
based on prior biological knowledge. The majority 
of reported associations, however, have been either 
false positives — due to small sample sizes, lack of 
replication and lack of stringency in the reporting 
of significant results based on observed p values — 
or have yielded only suggestive evidence for associ- 
ation; that is, replication in at least one other study 
but not meeting genome-wide significance defined 
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as p<5xl0 -8 (for examples, see Valdes and Spector 10 ). A 
notable exception of the success of the candidate gene sequen- 
cing approach in OA is the robust and reproducible association 
of rsl43383 in the growth differentiation factor 5 (GDFS) 
gene, 11-13 discussed in more detail below. 

The genome-wide association scans era 

In the last decade, the Human Genome and International 
HapMap Projects have revolutionised the field of common 
complex disease genetics by providing an extensive catalogue of 
genome sequence variation and linkage disequilibrium (LD) pat- 
terns between common variants. This has enabled the selection 
of tag single nucleotide polymorphisms (tag SNPs) — a set of 
informative, non-redundant markers capturing the majority of 
common variations across the genome — which led to the devel- 
opment of high throughput genotyping platforms in which hun- 
dreds of thousands of SNPs can be concurrently examined for 
association with disease. In recent years, this hypothesis-free 
approach of interrogating common variation in a genome-wide 
manner dominated the field of human genetics and led to the 
identification of numerous novel associations with several 
common complex diseases and traits. 14 OA was relatively late to 
enter the genome-wide association scans (GWAS) era but the 
returns were substantial; two novel associations from studies in 
individuals of Asian origin, 15 16 and 12 novel associations from 
scans performed in individuals of European ancestry, 17-22 were 
detected with genome-wide significance bringing the total of 
established OA loci to 15 (table 1). This review considers all 
associations with OA that have surpassed or have approached 
closely the stringent threshold of genome-wide significance fol- 
lowing replication in at least one independent dataset. Extensive 



fine mapping and functional studies are required to identify the 
causal variants and precise genes involved in OA pathogenesis. 

GENETIC ARCHITECTURE OF OA 

In line with other common complex disorders the genetic archi- 
tecture of OA appears to be highly polygenic with multiple var- 
iants across the full allele frequency spectrum contributing 
modest and small effects. The theory of a polygenic inheritance 
model for OA was first tested by the arcOGEN Consortium in a 
GWAS of 3177 cases and 4984 population based controls from 
the UK. 23 Using analytical approaches previously applied to test 
the polygenic inheritance of schizophrenia and bipolar dis- 
order, 24 a set of independent associated SNPs was derived from 
a subset of the data (90% of arcOGEN samples); this score 
allele set was then used to evaluate the proportion of case- 
control status accounted for in the remaining samples (10% of 
arcOGEN samples). These analyses revealed a substantial 
genetic component to OA comprising multiple contributing var- 
iants with small effect sizes. 

OA ESTABLISHED LOCI 
GDF5 

In the only candidate gene study that yielded a strong reprodu- 
cible association with OA thus far, Miyamoto et al, 11 searched 
for sequence variations in the exons and flanking regions of the 
GDF5 gene and identified the rsl43383 polymorphism — a T to 
C transition located in the 5' untranslated region (5'UTR) of the 
gene — to be significantly associated with hip OA. Combined evi- 
dence for association in two independent Japanese populations 
reached genome-wide significance with p=2xl0 -13 and allelic 
odds ratio (OR) of 1.79 (95% CI 1.53 to 2.09) (table 1). 



Table 1 Genetic associations with osteoarthritis established with genome-wide significance following replication in at least one independent 
dataset 



SNP 


Nearest* gene(s) 


EA 


EAF 


OR, 95% CI 


p Value 


Site 


Sex 


Ethnic group 


Source 


rs143383t 


GDF5 


T 


0.74 


1.79,1.53 to 2.09 


2x10~ 13 


Hip 


Both 


Asian 


11 


rs143383t 


GDF5 


T 


NA 


1.16, 1.11 to 1.22 


8.3x1 0- 09 


Knee 


Both 


European 


13 


rs7639618 


DVWA 


G 


0.63 


1.43, 1.28 to 1.59 


7.3x10~ 11 


Knee 


Both 


Asian 


15 


rs7775228t 


HLA-DQB1 


T 


0.62 


1.34, 1.21 to 1.49 


2.4x1 0~ 08 


Knee 


Both 


Asian 


16 


rs1 0947262* 


BTNL2 


C 


0.58 


1.31, 1.20 to 1.44 


5.1 x10- 09 


Knee 


Both 


Asian and European 


16 


rs3815148§ 


C0G5H 


C 


0.23 


1.14, 1.09 to 1.19 


8x1 0~ 08 


Knee and hand 


Both 


European 


17 


rs4730250§ 


DUS4LM 


G 


0.17 


1.17, 1.11 to 1.24 


9.2x1 0- 9 


Knee 


Both 


European 


18 


rs1 1842874 


MCF2L 


A 


0.93 


1.17, 1.11 to 1.23 


2.1 x10- 08 


Knee and hip 


Both 


European 


19 


rs6976** 


GLT8D1U 


T 


0.37 


1.12, 1.08 to 1.16 


7.2x10~ 11 


Hip and knee 


Both 


European 


20 


rs11177** 


GNL3U 


A 


0.38 


1.12, 1.08 to 1.16 


1.3x10~ 10 


Hip and knee 


Both 


European 


20 


rs4836732 


ASTN2 


C 


0.47 


1.2, 1.13 to 1.27 


6.1x10~ 10 


Hip 


Females 


European 


20 


rs9350591 


FILIP1; SENP6 


T 


0.11 


1.18, 1.12 to 1.25 


2.4x1 0- 09 


Hip 


Both 


European 


20 


rs1 0492367 


KLHDC5] PTHLH 


T 


0.19 


1.14, 1.09 to 1.20 


1.5x10- 08 


Hip 


Both 


European 


20 


rs835487 


CHST11 


G 


0.34 


1.13, 1.09 to 1.18 


1.6x10- 08 


Hip 


Both 


European 


20 


rs1 21 07036 


TP63 


G 


0.52 


1.21, 1.13 to 1.29 


6.7x1 0- 08 


Knee 


Females 


European 


20 


rs8044769tt 


FTO 


C 


0.5 


1.11, 1.07 to 1.15 


6.9x1 0" 08 


Hip and knee 


Females 


European 


20 


rs1 09481 72 


SUPT3H; CDC5L 


G 


0.29 


1.14, 1.09 to 1.20 


7.9x1 0~ 08 


Hip and knee 


Males 


European 


20 


rs6094710 


NC0A3 


A 


0.04 


1.28, 1.18 to 1.39 


7.9x1 0~ 9 


Hip 


Both 


European 


22 


rs1 2982744 


D0T1L 


C 


NA 


1.17, 1.11 to 1.23 


7.8x1 0~ 9 


Hip 


Males 


European 


21 



* Nearest gene(s) only shown. 

tSummary statistics of the same SNP in separate studies in Asians and Europeans, respectively. 
tSNPs in strong linkage disequilibrium. 
§SNPS in strong linkage disequilibrium. 

Hchr7q22 locus encompasses more genes than shown here, for full details see Kerkhof et a/ 17 and Day-Williams et a/. 19 
**SNPs in strong linkage disequilibrium. 

ttchr3p21.1 locus encompasses more genes than shown here, for full details see arcOGEN Consortium. 20 

ttThis signal was attenuated after BMI adjustment, suggesting that the FTO locus exerts its effect on OA through obesity. 

BMI, body mass index; EA, Effect allele; EAF, effect allele frequency; OA, osteoarthritis; SNP, single nucleotide polymorphism. 
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Evidence for association with knee OA was weaker in datasets 
from China (p = 3xl0~ 4 ) and Japan (p = 0.002). n A large scale 
meta-analysis employing 4791 hip OA cases and 6006 controls, 
and 4367 knee OA cases and 6291 controls, 12 showed that in 
samples of European descent there was less compelling evidence 
for association with hip OA (OR=1.07, 95% CI 1.01 to 1.14; 
p = 0.034) and more compelling evidence for association with 
knee OA (OR=1.13, 95% CI 1.06 to 1.20; p = 9xl0~ 5 ), but 
with a much weaker effect size than the East Asian set. These 
differences can be ascribed to allele frequency disparities 
between ethnic groups. The GDFS SNP was eventually found to 
be genome-wide significantly associated with knee OA in 
Europeans in a subsequent meta-analysis across a total of 6861 
knee OA cases and 10 103 controls (table l). 13 Genetic vari- 
ation in the GDF5 locus has also been robustly associated with 
height variation, 25 and linked with suggestive significance to 
lumbar disc degeneration, 26 fracture risk, 27 congenital disloca- 
tion of the hip, 28 29 and Achilles tendon pathology, 30 suggesting 
a pleiotropic effect from this gene. 

GDF5, alternatively known as cartilage derived morpho- 
genetic protein 1 or bone morphogenetic protein 14, is an 
extracellular signalling molecule, a member of the transforming 
growth factor (TGF-p) superfamily. Differential allelic expres- 
sion analysis supported a functional role of the rsl43383 poly- 
morphism with the risk allele (T) mediating reduced GDF5 
transcription relative to the C allele. 11 31 Further studies in joint 
tissues (cartilage, synovium, meniscus, ligament, tendon, and fat 
pad) obtained from individuals undergoing elective joint 
replacement for OA demonstrated a consistent allelic expression 
imbalance in all tissues tested, implying that the functional 
effect mediated by rsl43383 on GDF5 expression is joint- 
wide. 32 The role of GDFS in the development, maintenance, 
and repair of bone, cartilage, and other tissues of the synovial 
joint has been extensively reviewed. 33-37 Mutations in the 
GDF5 gene have been previously implicated in a broad spec- 
trum of skeletal disorders in humans (for an overview see 
Cornells et al 33 ) and mice. 38 ^ 14 



HLA class ll/lll locus 

A GWAS and replication study across —4800 Japanese indivi- 
duals identified two strongly correlated variants in a region con- 
taining human leucocyte antigen (HLA) class ll/lll genes that 
were significantly associated with susceptibility to knee OA 
(p=2.43xl0~ 8 , OR=1.34, 95% CI 1.21 to 1.49 for 
rs7775228; p=6.73xl0~ 8 , OR=1.32, 95% 1.19 to 1.46 for 
rsl0947262) (table l). 16 Only rsl0947262 replicated in two 
European populations with combined estimates of OR=1.31 
(95% CI 1.20 to 1.44) and p=5.10xl0" 9 . Thus far, these asso- 
ciations have not be generalised to other Asian or European 
populations as these variants failed to replicate in a population 
of Han Chinese and in a large scale European meta-analysis, 
from which it appears that they do not tag the same HLA class 
II haplotype as they do in Japanese individuals. 47 48 

These associations nevertheless strengthen the evidence that 
immunologic mechanisms are implicated in the aetiology of OA. 
The two SNPs, rs7775228 and rsl0947262, are located 
between the upstream region of HLA-DQA2 and HLA-DQB1 
and within intron 1 of BTNL2 respectively, but it should be 
noted that the HLA region is characterised by extensive linkage 
disequilibrium making it very difficult to pinpoint the precise 
genes. HLA class II molecules are expressed in antigen present- 
ing cells and have a central role in the immune system by pre- 
senting peptides derived from extracellular proteins. HLA class I 
and class II genes code for proteins that are highly polymorphic 
and have been implicated in the susceptibility to many disor- 
ders, including arthropathies such as rheumatoid arthritis. 49 
BTNL2 encodes butyrophilin-like 2 which is thought to regulate 
Tcell activation. 50 Activated T cells and Thl cytokine transcripts 
are present in chronic joint lesions of OA patients, suggesting 
that T cells could be contributing to chronic inflammation. 51 52 
Interaction between T cells and chondrocytes through cell 
surface molecules such as HLA, CD4 or CD8 has been shown 
in OA. 51 Peripheral blood T cells from OA patients compared to 
normal donors show significantly higher proliferative responses 
to autologous chondrocytes. 52 



DVWA 

Through a GWAS interrogating -100 000 SNPs, Miyamoto 
et al 15 identified a previously unknown gene on chromosome 
3p24.3, DVWA, to be associated with susceptibility to knee 
osteoarthritis in Japanese individuals. The association of 
rs7639618 — a missense SNP — was replicated in additional 
Japanese and Han Chinese cohorts with p = 7.3xl0 -11 and 
OR=1.43 (95% CI 1.28 to 1.59) (table 1). 

DVWA encodes a 276 amino-acid protein with two regions 
corresponding to the von Willebrand factor type A domain 
(VWA domain). DVWl expression studies in various human 
tissues revealed highest expression in cartilage tissues from both 
controls and individuals with OA suggesting that DVWA func- 
tion is associated with cartilage. 15 Mutations in the VWA 
domains of a different gene (MATN3) have been previously 
associated with hand OA in an Icelandic linkage scan, 45 and 
with multiple epiphyseal dysplasia. 46 DVWA protein binds to 
p-tubulin, and the binding is weakened when the risk alleles at 
two highly associated missense SNPs (allele G at rs7639618 and 
allele Tat rsll718863, both located in the VWA domain) form 
a haplotype (Tyrl69-Cys260) that was found to be overrepre- 
sented in OA cases of the discovery GWAS. 15 These findings led 
to speculation that DVWA supports intracellular transport and 
affects OA susceptibility by modulating the chondrogenic func- 
tion of p-tubulin. 



Chr7q22 locus 

The first novel locus for OA that reached genome-wide signifi- 
cance in Europeans was reported by a GWAS in Dutch indivi- 
duals in a gene dense region on chromosome 7q22. 17 Following 
large scale replication, allele C at rs3815148 was found to be 
associated with knee and/or hand OA with p = 8xl0 -8 and 
OR=1.14 (95% CI 1.09 to 1.19) (table 1). The association with 
knee OA was further corroborated and reinforced by a 
meta-analysis across four other GWAS (deCODE, Rotterdam, 
Framingham, TwinsUK) performed under the auspices of the 
TreatOA Consortium. 18 

The chr7q22 locus harbours six genes, PRKAR2B, HPB1, 
COG5, GPR22, DUS4L, and BCAP29, within a large linkage 
disequilibrium block making it difficult to pin down the culprit 
gene. Since the GPR22 gene encodes a G-protein coupled recep- 
tor which is an attractive, potential drug target, this gene was 
taken forward for functional analysis. Immunohistochemistry 
experiments showed that the GPR22 protein was present in car- 
tilage and osteophytes in OA mouse models but absent from 
normal cartilage, providing some indication that GPR22 could 
be the causal gene. 17 Further gene expression studies using joint 
tissues from OA patients and control cartilage from patients 
who had neck of the femur fractures found significantly lower 
expression levels in OA cartilage compared with control cartil- 
age for five genes in the region — the exception being GPR22, 
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which was not detected. 53 Carriers of the OA risk allele showed 
a significant reduction in expression of HBP1 (HMG-box tran- 
scription factor which encodes a transcriptional repressor) in 
cartilage and synovium and of DUS4L (dihydrouridine synthase 
4-Like) in fat pad. 

MCF2L 

Using 1000 Genomes Project based imputation in a GWAS for 
OA by the arcOGEN Consortium (3177 OA cases and 4894 
controls), UK scientists were able to establish the third novel 
locus for OA in Europeans and the first common complex 
disease locus to be identified via 1000G imputation. 19 
Following large scale replication, rsl 1842874 in intron 4 of 
MCF2L (MCF.2 cell line derived transforming sequence-like, 
encoding the rho-specific guanine nucleotide exchange factor) 
reached genome-wide significance with p=2.1xl0 -8 and 
OR=1.17 (95% CI 1.11 to 1.23) (table 1). 

MCF2L has been implicated in both skeletal and pain related 
outcomes of OA. Mcf2l rat models of OA have shown that the 
protein is expressed in articular chondrocytes. 54 55 Another 
study in Mcf2l rat models found that its expression was highest 
in 5 -week-old rat brain sections, and could be localised to neu- 
rones and a-tanycytes — bipolar cells in the hypothalamus bridg- 
ing the cerebrospinal fluid to the portal capillaries — suggesting 
that Ost may participate in axonal transport in these specialised 
cells. 56 In zebrafish, mcf2l was dynamically expressed in a range 
of cell types during development, including Kupffer's vesicle. 57 
Diffuse expression of mcf2l was observed in the zebrafish brain 
throughout development — consistent with the strong expression 
seen in the brain in rat — and in the developing zebrafish jaw car- 
tilages, which suggests that mcf2l could also have a function in 
cartilage development. In human cells, mcf2l has been shown to 
regulate neurotrophin-3 -induced cell migration in Schwann 
cells. 58 Neurotrophin-3 is a member of the nerve growth factor 
(NGF) family. Knee OA patients treated with a humanised 
monoclonal antibody that inhibits NGF have shown improve- 
ments in both joint function and pain outcomes. 59 60 

Chr3p21.1 locus 

The following eight loci were discovered by the largest single 
GWAS for knee and/or hip OA to date, performed by the 
arcOGEN Consortium in 7410 cases and 11 009 population 
based controls from the UK, and confirmed in replication 
efforts including up to 7473 cases and 42 938 controls of 
European descent. 20 

Two perfectly correlated SNPs in chr3p21.1 situated in an 
extended LD block comprising over 30 genes were associated 
with OA, and association was stronger in patients ascertained by 
the more homogeneous criterion of TJR compared to a mixture 
of TJR and radiographically defined cases (ROA). rs6976 
(p=7.24xl0- n , OR=1.12, 95% CI 1.08 to 1.16) is situated in 
the 3' UTR of the GLT8D1 (glycosyltransf erase 8 domain con- 
taining 1) gene, and rslll77 (p = 1.25 x 10~ 10 , OR=1.12, 95% 
CI 1.08 to 1.16) is a missense polymorphism within exon 3 of 
GNL3 (guanine nucleotide binding protein-like 3, or nucleoste- 
min) (table 1). GNL3 is expressed in mesenchymal stem cells, 
from which chondrocytes are derived, and regulates the Gl-S 
phase transition in stem cells. 61-63 In cultured chondrocytes 
from patients with OA as compared with control subjects, 
nucleostemin protein values were substantially higher, suggest- 
ing that this gene may be functionally important in the patho- 
genesis of OA. 20 However, because of the large number of other 
genes in the same LD block, substantial follow-up work is 
required to identify the culprit gene. 



ASTN2 

rs4836732 located within intron 18 of the ASTN2 gene was 
found to be most highly associated with female total hip 
replacement (THR) (p=6.11 x 10" 10 , OR=1.20, 95% CI 1.13 
to 1.27) (table l). 20 ASTN2 (astrotactin 2) is a membrane 
protein that regulates surface levels of ASTN1 during neuronal 
migration 64 and is highly expressed in the developing and adult 
brain. An intronic SNP within ASTN2 has been shown to have 
some evidence of involvement with the pathogenesis of adult 
attention deficit hyperactivity disorder (ADHD). 65 In rare CNV 
analysis it has been shown that exonic deletion and duplication 
in the ASTN2 locus is associated with schizophrenia. 66 

FILIP1; SENP6 

rs9350591 was found to be significantly associated with hip OA 
(p=2.42xl0~ 9 , OR=1.18, 95% CI 1.12 to 1.25) (table l). 20 
This variant is located 38 kb upstream of FILIP1 (filamin A inter- 
acting protein 1) and 70 kb upstream of SENP6 (sentrin specific 
peptidase 6). The role of these poorly characterised genes in OA 
has not been explored yet. However, COL12A1 (collagen, type 
XII, al) is found ~326 kb away from the index SNP Type XII 
collagen is expressed by osteoblasts and localises to the perios- 
teum — an active area of bone formation. Col 12a-/- null mice 
exhibit several skeletal abnormalities and alterations in the organ- 
isation and polarisation of osteoblasts, suggesting a role for type 
XII collagen in osteoblast differentiation and bone matrix 
formation. 67 

KLHDC5; PTHLH 

rsl 04923 67 reached genome-wide significance in the hip OA 
analysis (p=1.48xl0~ 8 , OR=1.14, 95% CI 1.09 to 1.20) 
(table l). 20 This SNP is situated 59 kb downstream of KLHDCS 
(kelch domain containing 5) and 96 kb downstream of PTHLH 
(parathyroid hormone-like hormone). PTHLH presents an 
excellent candidate gene for OA as this hormone is known to 
regulate endochondral ossification (ie, bone development) by 
inhibiting chondrocytes from hypertrophy (reviewed in Zhang 
et al 68 ). Parathyroid hormone related peptide expression is 
higher in chondrocytes from pathologic articular cartilage than 
from normal cartilage of humans. 69 Pthrp-/- mice that survived 
gestation have accelerated differentiation of chondrocytes in 
bone. 70 

CHST11 

rs835487 within intron 2 of CHST11 was found to be most sig- 
nificantly associated with THR (p=1.64x 10" 8 , OR=1.13, 95% 
CI 1.09 to 1.18) (tablel). 20 CHST11 (carbohydrate sulfotrans- 
ferase 11), also known as chondroitin-4-sulfotransferase-l 
(C4ST-1), encodes an enzyme specific for the transfer of sulfate 
groups to the 4-0 position in chondroitin, with chondroitin sul- 
phate (CS) being an important component of cartilage proteo- 
glycans. CHST11 has significantly higher expression in 
osteoarthritic compared to normal articular cartilage. 71 Proper 
4-0 sulfation of chondroitin by CHST1 1 plays a crucial role in 
skeletal development and signalling events and in human 
disease, including cancer (reviewed in Kluppel 72 ). Mice homozy- 
gous for a mutation in CHST11 die shortly after birth with 
severe chondrodysplasia, growth plate defects, and accelerated 
chondrocyte differentiation. The reduction in C4S in mutant 
mice leads to changes in the spatial distribution of CS and 
altered patterns of TGF-p and bone morphogenetic protein sig- 
nalling in the cartilage growth plate. 73 
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TP63 

rs 12 10703 6 in intron 12 of TP63 (tumour protein p63) was 
associated with total knee replacement (TKR) in females with 
borderline genome-wide significance (p = 6.71 X 10 -8 , 
OR=1.21, 95% CI 1.13 to 1.29) (table l). 20 The role of p63 
mutations in cancer is well established (reviewed in Muller and 
Vousden 74 ). Recently a polymorphism in this gene has been 
robustly associated with facial morphology in Europeans. 75 p63 
null mice have major defects in their limb, craniofacial and epi- 
thelial development implying a role for this gene in skeletal 
function. 76 77 

FTO 

rs8 044769 within intron 1 of FTO was most highly associated 
with OA in females (p=6.85x 1(T 8 , OR=l.ll, 95% CI 1.07 to 
1.15) (table l). 20 Variation in the FTO (fat mass and obesity 
associated) gene is known to play an important role in suscepti- 
bility to obesity, 78 and rs8044769 is in partial LD (r2>0.6) with 
the reported most highly associated SNP for body mass index 
(BMI). Overweight/obesity is a well established risk factor for 
OA susceptibility and it is also a predictor for OA progression, 
especially of the knee joint and less of the hip joint. 8 79 The 
signal was attenuated after BMI adjustment, suggesting that the 
FTO gene exerts its effect on OA through obesity. 20 

SUPT3H; CDC5L 

The only signal from the arcOGEN GWAS that exhibited the 
highest association in the male OA stratum was from 
rsl0948172 (p=7.92x 1(T 8 , OR=1.14, 95% CI 1.09 to 1.20) 
situated between the CDCSL (CDC5 cell division cycle 5-like) 
and SUPT3H (suppressor of Ty3 homolog) genes with unclear 
roles in OA (table l). 20 However -500 kb away but in the same 
LD block is the RUNX2 (runt related transcription factor 2) 
gene which codes for a multifunctional transcription factor 
essential for osteoblast development and proper bone forma- 
tion. 80 Runx2 controls skeletal development by regulating the 
differentiation of chondrocytes and osteoblasts and the expres- 
sion of many extracellular matrix protein genes during this 
process (reviewed in Komori 81 ). Consistent with its role as a 
master organiser, alterations in RUNX2 expression levels have 
been associated with skeletal diseases in human and mice. 82 83 
RUNX2 has been suggested as a possible biomarker of bone 
metabolism in several forms of arthritis. 84 

NC0A3 

The largest GWAS meta-analysis for OA to date by the 
TREAT-OA consortium (in 11 277 hip OA cases and 67 473 
controls including follow-up studies) established an additional 
variant, rs6094710, located near NCOA3 with p = 7.9xl0" 9 
and OR=1.28 (95% CI 1.18 to 1.39) (table l). 22 This gene is 
expressed in articular cartilage and its expression is significantly 
reduced in OA affected cartilage compared to preserved cartil- 
age from the same joint. 22 

The molecular mechanism by which NCOA3 could cause OA 
is rather unclear. NCOA3 is involved in the co-activation of dif- 
ferent nuclear receptors, such as for steroids (GR and ER), reti- 
noids (RARs and RXRs), thyroid hormone (TRs), vitamin D3 
(VDR) and prostanoids (PPARs). Several of these hormones 
have been implicated in skeletal metabolism and osteoarthritis. 
NCOA3 knockout mice exhibit growth retardation and reduced 
adult body size, but the molecular mechanism responsible for 
this growth retardation remains largely unknown. In female 
mice the reproductive system showed abnormal development 



and oestrogen values were significantly lower than in the wild 
type indicating a putative role of NCOA3 in steroid regula- 
tion. 85 Alternative hypotheses for a causal effect of the NCOA3 
gene in OA are through regulation of the target tissue responses 
to thyroid hormone T3 or through transcriptional regulation in 
mechanotransduction. 22 

DOTL1 

Prompted by the strong association of a variant in the DOT1L 
gene with minimum joint space width (minJSW) at the hip, 86 
the TreatOA consortium and other European studies performed 
recently a large scale meta-analysis across seven OA scans to 
empower the examination of the association of this variant with 
hip OA. In male subjects the C allele of DOT1L rsl2982744 
was found significantly associated with OA (p = 7.8xl0 -9 , 
OR=1.17, 95% CI 1.11 to 1.23) whereas for both genders 
combined the p value was 8.1 xlO -8 (table l). 21 Interestingly, as 
for the GDFS polymorphism, the same DOT1L variant asso- 
ciated with OA has also been associated with height, 87 suggest- 
ing a role in skeletal formation. 

rsl2982744 is in the first intron of DOT1L (DOTl-like, 
histone H3 methyltransferase) gene which encodes an essential 
and dedicated enzyme for Wnt target gene activation in the 
intestine and is required for the expression of genes that require 
high levels of Wnt signalling in drosophila. 88 89 Wnt signalling 
is critical in the formation of cartilage and bone and in the 
development of the synovial joint. In vitro functional analyses 
demonstrated a role for DOT1L in chondrogenesis and the 
protein was found to interact directly with transcription factor 4 
— a transcription factor interacting with p-catenin — suggesting a 
role for this gene in the Wnt/p-catenin signalling cascade in 
developing chondrocytes. 86 

THE IMPORTANCE OF PHENOTYPE DEFINITION 
IN GENETIC STUDIES OF OA 

Studies in other musculoskeletal traits have demonstrated the 
increase in power that can be afforded by investigating quantita- 
tive traits closer to the underlying biological phenotype. For 
example, in the largest GWAS for osteoporosis to date 56 loci 
associated with bone mineral density at genome-wide signifi- 
cance compared to six loci associated with the hard clinical 
outcome of fracture. 90 In contrast to several diseases that can be 
considered as the extreme of the distribution of a physiological 
trait, OA is a highly heterogeneous disease affecting the entire 
joint and is manifested at different or several joint sites (general- 
ised OA). OA is characterised by variable clinical features with 
possibly different genetic aetiologies. Currently OA ascertain- 
ment is based on either radiographically derived or symptomatic 
criteria, or a combination of these. Radiographic definition con- 
siders only pathophysiological joint features scored from radio- 
graphs. Symptomatic definition considers OA cases when both 
radiographic and pathological symptoms such as pain, stiffness, 
and loss of function at joints are present. For radiographic OA 
(ROA) several scoring systems exist, but the most widely used is 
the Kellgren-Lawrence (KL) grading system with grade 2 (defini- 
tive small osteophytes and little/mild joint space narrowing) or 
over being classified as ROA. However, an investigation by the 
TreatOA consortium on phenotype standardisation noted that 
among major cohort studies KL scores are interpreted differ- 
ently, especially for the knee and hip. 91 The great variability of 
disease definition among different studies presents an extra 
source of phenotype heterogeneity. 

There are several arguments for and against using a more 
homogeneous phenotype for OA by examining cases ascertained 
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by TJR only. Pain and disability among subjects undergoing TJR 
are often poorly correlated with radiographic severity and TJR 
candidates show considerable heterogeneity in these symptom- 
atic and radiographic features of OA. 92 93 On the other hand, 
TJR definition for OA has been proposed for randomised clin- 
ical trials (RCTs) as it is the main clinical outcome that is repre- 
sentative of severe symptomatic large joint OA. 94 The familial 
concordance for hip and knee OA is greater in surgically 
defined than in radiographically defined disease. 95-97 In the 
arcOGEN study the authors were able to investigate the effect 
of OA phenotype definition on the strength of association of 
the eight established signals by comparing the results of 
meta-analyses employing TJR only cases as opposed to studying 
all cases (TJR and ROA combined). Four signals (rs6976, 
rs4836732, rs835487, rsl2107036) showed stronger evidence 
for association in the TJR meta-analysis compared to the ana- 
lysis of all cases, despite the considerable decrease in sample size 
and number of studies in the discovery and replication sets. 
Only one signal (rs9350591) was stronger in the meta-analysis 
employing all cases and three signals (rsl0492367, rs8044769, 
rsl0948172) remained relatively unchanged. 20 

Pain, the most common and discomforting symptom for OA, 
is also an important phenotype to study, but the limited studies 
for OA related pain to date have not been able to robustly 
detect any underlying genetic variants (reviewed in Van Meurs 
and Uitterlinden 98 ). 

As with other traits (eg, hypertension and blood pressure 
measurements 99 10 °) it is anticipated that the examination of 
underlying, intermediate traits that together synthesise the 
phenotype of OA but are closer to the biology of the disease 
could be very advantageous in such a heterogeneous disorder. 
An example of this approach is the implication of the DOT1L 
gene in hip OA pathogenesis. The DOTL1 locus was first dis- 
covered significantly associated with cartilage thickness, as mea- 
sured by joint space width on radiographs, in a relatively small 
number of subjects, but did not reach genome-wide significance 
in a well sized case-control analysis for hip. 86 Subsequently and 
upon additional follow-up, large scale, replication efforts in 
several hip OA case-control datasets, the association of DOTL1 
with OA was eventually established with genome-wide 
significance. 21 

There is some evidence that genetic factors influence joint 
morphology, specific anatomic pattern of joint involvement, 
severity, and bone responses in OA at the hip and knee, and so 
these could represent promising endophenotypes to be studied. 
For example, morphological features such as the pistol grip 
deformity (PGD), femoral neck shaft angle (FNSA), the alpha 
angle and the lateral centre edge (LCE) angle have been asso- 
ciated with hip OA and may be under genetic control. 101-103 
Bone responses to hip OA may be classified as atrophic, normo- 
trophic or hypertrophic, with atrophic OA being is a more pro- 
gressive form of OA than hypertrophic OA. 104 In the only 
genetic study for bone response to OA, the risk for definite hip 
OA among siblings was twofold higher in siblings of index parti- 
cipants who had an atrophic pattern of disease than in siblings 
whose index case had any degree of osteophyte. 105 

SITE AND SEX SPECIFIC DIFFERENCES AT OA LOCI 

There is compelling evidence that there are joint specific genetic 
factors contributing to OA aetiology, 106 consistent with the sig- 
nificant differences reported in OA prevalence between different 
skeletal sites. 107 It is thus not surprising that most of the GWAS 
for OA conducted thus far have stratified cases according to OA 



manifested either at the hip or at the knee joint and have identi- 
fied site specific associations. 

Sex differences have also been reported in the prevalence of 
OA, 107 with female sex being an important risk factor for OA. 
Epidemiological studies have suggested that oestrogen loss may 
be accompanied by an increase in the prevalence and incidence 
of knee and hip OA in females, 108 which may partly explain the 
sex differences in the prevalence of OA. Genetic studies that 
have stratified for sex have identified some clear differences. 
The most compelling example is the significantly different 
(p = 0.003) effect size estimate between both sexes at the 
DOTL1 polymorphism (OR=1.17, 95% CI 1.11 to 1.23, 
p=7.8xl0 -9 in males vs OR=1.05, 95% CI 1.00 to 1.10, 
p = 0.04 in females), and some of the loci identified by the 
arcOGEN study (ASTN2, TP63, and FTO significantly asso- 
ciated with OA in females and SUPT3H;CDCSL locus in 
males). 20 21 

TRANSLATIONAL POTENTIAL OF CURRENT FINDINGS 

It is universally accepted that in characterising the genetic aeti- 
ology of common multifactorial diseases that can be ascribed to 
common variation, the GWAS approach has been very fruitful. 
However, because of the modest and small effect sizes exerted 
by the majority of common variants the translational potential 
of GWAS findings has been extensively criticised. Small effect 
sizes, however, should not undermine the biological importance 
of the genetically implicated genes. 

An excellent example is the recent implication of CHST11 in 
OA. CHST11 codes for an enzyme responsible for the forma- 
tion of CS, an important cartilage proteoglycan, with proteogly- 
can modulation being a currently active area of OA therapeutic 
development. CS is used as a symptomatic, slow acting drug for 
OA recommended by the latest OA Research Society 
International (OARSI) treatment guidelines but, despite exten- 
sive trials, evidence for its effectiveness remains controversial. 109 
The implication of CHST1 1 as a risk locus for OA suggests that 
alternate therapeutic approaches targeting the same pathway 
may be clinically beneficial. 

The implication of the PTHLH locus in risk of OA may pave 
the way for exploring recently developed novel anabolic treat- 
ments for osteoporosis (peptide fragments based on parathyroid 
hormone) in the management of OA. 

The genetic association with FTO confirms existing epidemio- 
logical evidence of the interplay between obesity and OA and 
highlights existing clinical recommendations that weight loss 
regimens may offer symptom relief and avoidance. 

Furthermore, the biological insights afforded by the novel 
robust associations represent the largest, though indirect, trans- 
lational contribution of these GWAS findings to OA. 

FUTURE STUDIES IN OA 

In line with other common complex diseases, all OA associated 
variants thus far collectively explain only a small fraction 
(<10%) of the genetic component. There are possibly several 
more common variants to be discovered for OA through larger 
scale meta-analytical efforts, 23 but also low frequency and rare 
variants, structural variants, gene-environment interactions, and 
epigenetic changes are likely to contribute substantially towards 
this missing heritability. 110 

As the new era of next generation sequencing (NGS) associ- 
ation studies is emerging, the field of complex disease genetics is 
now focusing on the contribution of low minor allele frequency 
(MAF 1-5%) and rare variants (MAF <1%). Such variants may 
have larger effect sizes, higher penetrance, and point to causal 
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genes or functional units (eg, regulatory regions) more readily. 
Studies of rare variation in OA are currently underway. 

The study of less heterogeneous, narrower OA endopheno- 
types closer to the biology of the disease is likely to lead to 
many more common and low frequency/rare variants underpin- 
ning specific and clinically relevant processes of disease develop- 
ment and progression. In addition, large scale studies 
investigating interactions between genetic and environmental 
risk factors can conceivably help shape approaches of disease 
management. Ultimately coupling all these genetic variants to 
function through functional studies and by integration with data 
generated from transcriptomics — the study of gene expression — 
and epigenomics — the study of epigenetic modifications such as 
DNA methylation, histone modifications, etc in the control of 
gene expression — will shape future genomics research in OA. 

SUMMARY 

Over the past few years GWAS in individuals of European and 
Asian ethnicity have collectively robustly identified 15 OA asso- 
ciated variants with genome-wide significance. All of the var- 
iants that have been detected thus far are common in frequency 
— which is by definition what GWAS are designed for — and 
appear to confer small to modest effect sizes. Fine mapping is 
required to identify which are the causal variants at the estab- 
lished loci. In addition, functional work is required to establish 
the causal genes, particularly for the loci that encompass many 
genes in regions of extended linkage disequilibrium. Despite 
these limitations the research that has been carried out thus far 
has provided insights into the biological processes that underlie 
OA susceptibility and has revealed some candidates with transla- 
tional potential. The future outlook for OA genetics appears 
likely to be shaped by larger meta-analytical efforts to identify 
additional susceptibility loci, NGS approaches that can interro- 
gate low frequency and rare variation, expanded and tighter OA 
phenotype definitions, and the integration of genetic variation 
studies with epigenetics and transcriptomics. 
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